Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Biology (Basel) ; 12(5)2023 May 12.
Artigo em Inglês | MEDLINE | ID: mdl-37237520

RESUMO

An ever-growing amount of accumulated data has materialized in several scientific fields, due to recent technological progress. New challenges emerge in exploiting these data and utilizing the valuable available information. Causal models are a powerful tool that can be employed towards this aim, by unveiling the structure of causal relationships between different variables. The causal structure may avail experts to better understand relationships, or even uncover new knowledge. Based on 963 patients with coronary artery disease, the robustness of the causal structure of single nucleotide polymorphisms was assessed, taking into account the value of the Syntax Score, an index that evaluates the complexity of the disease. The causal structure was investigated, both locally and globally, under different levels of intervention, reflected in the number of patients that were randomly excluded from the original datasets corresponding to two categories of the Syntax Score, zero and positive. It is shown that the causal structure of single nucleotide polymorphisms was more robust under milder interventions, whereas in the case of stronger interventions, the impact increased. The local causal structure around the Syntax Score was studied in the case of a positive Syntax Score, and it was found to be resilient, even when the intervention was strong. Consequently, employing causal models in this context may increase the understanding of the biological aspects of coronary artery disease.

2.
Mol Med ; 29(1): 12, 2023 01 24.
Artigo em Inglês | MEDLINE | ID: mdl-36694130

RESUMO

BACKGROUND: Amyotrophic lateral sclerosis (ALS) is a rare progressive neurodegenerative disease that affects upper and lower motor neurons. As the molecular basis of the disease is still elusive, the development of high-throughput sequencing technologies, combined with data mining techniques and machine learning methods, could provide remarkable results in identifying pathogenetic mechanisms. High dimensionality is a major problem when applying machine learning techniques in biomedical data analysis, since a huge number of features is available for a limited number of samples. The aim of this study was to develop a methodology for training interpretable machine learning models in the classification of ALS and ALS-subtypes samples, using gene expression datasets. METHODS: We performed dimensionality reduction in gene expression data using a semi-automated preprocessing systematic gene selection procedure using Statistically Equivalent Signature (SES), a causality-based feature selection algorithm, followed by Boosted Regression Trees (XGBoost) and Random Forest to train the machine learning classifiers. The SHapley Additive exPlanations (SHAP values) were used for interpretation of the machine learning classifiers. The methodology was developed and tested using two distinct publicly available ALS RNA-seq datasets. We evaluated the performance of SES as a dimensionality reduction method against: (a) Least Absolute Shrinkage and Selection Operator (LASSO), and (b) Local Outlier Factor (LOF). RESULTS: The proposed methodology achieved 85.18% accuracy for the classification of cerebellum or frontal cortex samples as C9orf72-related familial ALS, sporadic ALS or healthy samples. Importantly, the genes identified as the most determinative have also been reported as disease-associated in ALS literature. When tested in the evaluation dataset, the methodology achieved 88.89% accuracy for the classification of sporadic ALS motor neuron samples. When LASSO was used as feature selection method instead of SES, the accuracy of the machine learning classifiers ranged from 74.07 to 96.30%, depending on tissue assessed, while LOF underperformed significantly (77.78% accuracy for the classification of pooled cerebellum and frontal cortex samples). CONCLUSIONS: Using SES, we addressed the challenge of high dimensionality in gene expression data analysis, and we trained accurate machine learning ALS classifiers, specific for the gene expression patterns of different disease subtypes and tissue samples, while identifying disease-associated genes.


Assuntos
Esclerose Lateral Amiotrófica , Doenças Neurodegenerativas , Humanos , Esclerose Lateral Amiotrófica/genética , Aprendizado de Máquina , Marcação de Genes
3.
Stud Health Technol Inform ; 290: 1008-1009, 2022 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-35673179

RESUMO

Within the most recent years, most of the cancer patients are older age, which implies the necessity to a better understanding of aging and cancer connection. This work presents the LifeChamps solution built on top of cutting-edge Big Data architecture and HPC infrastructure concepts. An innovative architecture was envisioned supported by the Big Data Value Reference Model and answering the system requirements from high to low level and from logical to physical perspective, following the "4+1 architectural model".


Assuntos
Sobreviventes de Câncer , Nomes , Neoplasias , Inteligência Artificial , Big Data , Humanos , Inteligência
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...